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Abstract —In this paper, the problem of optimal maximum 
likelihood detection in a single user single-input multiple-output 
(SIMO) channel with phase noise at the receiver is considered. 
The optimal detection rules under training are derived for 
two operation modes, namely when the phase increments are 
fully correlated among the M receiver antennas (synchronous 
operation) and when they are independent (non-synchronous 
operation). The phase noise increments are parameterized by a 
very general distribution, which includes the Wiener phase noise 
model as a special case. It is proven that phase noise creates 
a symbol-error-rate (SER) floor for both operation modes. In 
the synchronous operation this error floor is independent of M, 
while it goes to zero exponentially with M in the non-synchronous 
operation. 

I. Introduction 

The demand for wireless data traffic over cellular networks 
is expected to increase rapidly in the years to come. The 
deployment of base stations with an excess of base station 
(BS) antennas, M, m, termed as Massive multiple-input 
multiple-output (MIMO) systems, is a technology that offers 
unprecedented gains in energy and spectral efficiency 12, 
0. In particular, it has been shown that one can reduce the 
required radiated power by 1.5 dB to achieve a desired fixed 
information rate for every doubling of the base station (BS) 
antennas, when imperfect channel state information (CSI) is 
available and linear processing techniques are used 0, 0- 

The gains offered by Massive MIMO can be achieved in 
the uplink by coherently combining the received signals using 
estimated channel impulse responses. A hardware impairment 
that hinders coherent communication is phase noise. Phase 
noise is introduced in communication systems by imperfec¬ 
tions in the circuitry of the local oscillators that are used to 
convert the baseband signals to the passband and vice versa 
0. These imperfections are caused by the limited hardware 
complexity of practical circuit implementations. Since Massive 
MIMO relies on coherent combining, it is essential to study the 
effect of phase noise on the information rate of these systems. 

Recent work has shown that in the phase noise impaired 
Massive MIMO uplink with imperfect CSI and linear receive 
processing, one can still reduce the radiated power by 1.5 dB 
for every doubling of M and achieve a fixed desired per user 
information rate 0, 0. Further, it has been observed by 
various authors that using independent phase noise sources 
at the uplink receiver can provide improved performance. 
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In 0 the authors show that the error vector magnitude 
(EVM) at the direction of the main lobe in beamforming 
is smaller with independent phase noise sources. In m a 
frequency selective Massive MIMO uplink with imperfect CSI 
and linear receive processing is considered. Lower bounds 
on the sum-rate performance for the cases of a common or 
many independent local oscillators are compared and it is 
shown that the sum-rate performance is better in the case of 
independent oscillators. Also, in CD the authors find that 
the second order capacity expansion in the high signal-to- 
noise ratio (SNR) regime of a single-input multiple-output 
(SIMO) channel is higher for the case of separate oscillators. 
Furthermore, 02 shows that the phase noise variance can 
be allowed to increase logarithmically with M in multi¬ 
cell systems with non-synchronous operation without losing 
much of the performance, while this is not possible in the 
synchronous operation. Finally, in lfl3l the authors observe the 
same phenomenon for other types of hardware impairments. 

In this work we consider the problem of optimal maximum 
likelihood (ML) detection in a single user SIMO channel 
with phase noise impairments at the receiver. Communication 
is done in two time slots: in the first time slot a training 
symbol is transmitted and in the second time slot a data 
symbol is sent. The receiver then calculates the ML detection 
rule using the information that has been acquired during 
training and data transmission. The ML detection rule is 
given explicitly for the cases of one common and multiple 
independent oscillators. The phase noise increments attain a 
very general parametrization, which enables the analysis of 
various phase noise models. Contrary to the no-phase-noise 
case, the high SNR analysis shows that the symbol-error-rate 
(SER) performance exhibits an error floor in both operations. 
For the synchronous operation, this error floor is independent 
of the number of receive antennas, M. However, for the non- 
synchronous operation, an upper bound on the SER shows that 
the error-floor is reduced to zero exponentially with M. 

II. System Model 

We consider a single antenna user that transmits information 
symbols to a BS array of M antenna elements. The system is 
impaired by phase noise at the BS but not at the user terminal. 
Further, no fading is assumed. This assumption enables us to 
focus on the fundamental effects of phase noise. The combined 
effect of phase noise and channel fading will be considered in 
future work. Communication is scheduled in two time slots, 


each of duration T s seconds, where T s is the symbol interval. 
During the first time slot the user transmits a pilot symbol 
so that the receiver can get an initial phase estimate. The 
baseband equivalent representation of the received signal, x m , 
at the m-th BS antenna at the first channel use is given by 

Xm = \Zpt7 T , (1) 

where w m is the m-th component of the additive white 
Gaussian noise (AWGN) column vector w ~ Ac (0, Im) and 
is independent of 9 m . The known positive scalar p corresponds 
to the SNR measured at each received antenna. The phase 9 m 
is an unknown initial phase uniformly distributed in [—7r,7r). 
In this work we consider two distinct operations. In the 
non-synchronous operation, the phases 0 rn are independent 
of each other. This corresponds to an operation where each 
antenna uses a separate oscillator for the downconversion 
of the bandpass signal to the baseband. In the synchronous 
operation we assume that 9\ = • • • = 6m = 6, which 
corresponds to an operation where a common oscillator is used 
for the downconversion of the received passband signal to the 
baseband. 

During the second time slot the user transmits a data sym¬ 
bol, s, from a constellation, S, with E[s] = 0 and E[|s| 2 ] = 1. 
The received signal at the m-th BS antenna is then given by 

y m = s + z m , (2) 

where z m is the m-th component of the AWGN column vector 
2 ~ Ac (0, Im) and the vector [w T z T ] is jointly Gaussian 
Ac (0, / 2 m)- Also 0 m is the random phase noise increment, 
independent of 9 m , the information symbol, s, and the AWGN 
noise, z m . Since we have 0 m G [—7r,7r), the probability 
density function (pdf), p$ M (0 m ), of the phase noise increment 
0 m can be expressed by its Fourier expansion 041 . IT5l . In 
this work we consider a general model for p$ M (0 m ), i.e. 

P$m (0m) = + 2 ^ cos (IfimJj , (3) 

where a m} i, l = 0,1,2,..., are real and known constants. 
The Fourier expansion in ([3]) can model any continuous, 
differentiable, unimodal, even and zero mean pdf. Later in 
the paper we will select a specific pdf, namely, the circular 
normal distribution, however, most of the results presented 
here are valid for any choice of p$ m (0 m ) that has an ex¬ 
pansion as in For the synchronous operation the phase 
noise increments 0 m are assumed to be identical among the 
antennas, whereas for the non-synchronous operation they are 
assumed independent. Finally, the system model for the non- 
synchronous operation is given in matrix vector form by 

x = t/p® 1 + w (4) 

y = Vp & ® s1 + ( 5 ) 

where 1 is the all-one column vector of size M, © = 
diag {e-?® 1 ,..., e 30M } and $ = diag {e^ 1 ,..., e^ M }. The 

The circular normal distribution is often used in literature to model the 
phase disturbance at the output of a local oscillator (§). 


system model for the synchronous operation is trivially derived 
from ® and ® by setting © = e :l °I m and $ = e J ^I m- 

III. Maximum Likelihood Detection 

In this section we derive the likelihood function of the 
received vectors x and y given the transmitted symbol, s, 
p(x,y\s), for both operations. Then the ML detection rule is 
given by 

s = argmaxp(ai, y\s). (6) 


Proposition 1: The pdf of the received vectors (at, y) given 
a symbol s for the non-synchronous operation is given by 

M / oo \ 

p(x,y\s) = A ( /3 m ,o + 2^/? mji cos((Cm) j (7) 

m= 1 \ 1=1 ) 

where 

A £ exp (—IMI 2 - ||y|| 2 - P M{ 1 + |s| 2 )) /tt 2M , (8) 
Pm,l = Otm,lIl{2y/p\s*y m \)Ii(2y/p\x m \), (9) 

Cm = arg (y m ) - arg(a: m ) - arg(s). (10) 


/;(•) is the /-th order modified Bessel function of first kind 
m and || • || is the Euclidean norm. 

Proof: Define the vectors 9 = \0 \,..., 0m} 1 and <f> = 
..., 4>m\ T ■ The likelihood function is given by 


p{x, y\s) = Jj p(x, y\s, 9 ,0)p(0, 4>\s)d9d(f) 
= JJ p(x\9)p(y\s, 0,0)p(0)p(0)d0d0 


= 11 I P( x m\0 m )p(9 m ) /p(y m |s,0 m ,0 m )p(^. ro )d0 m d 9 

mi 


M 


=Xm(ym\0m,s) 


(ID 


where (a) follows from the fact that conditioned on 0, 0 and s, 
the vectors x and y are independent and (b) is a consequence 
of the independence of the components in 0, 0, w and 2 . The 
channel probability law p(y m \s, 9 m , <j> m ) can be expressed as 

P(,ym\s , 0m> 0m) — exp 
7T 

e -\Vm \ 2 - p\s \ 2 

=-exp {2sfp\s*y m \cos (0 m + 0m-arg{s*y m })) 

7r 

e -\Vm\ 2 ~p\s \ 2 ( 

= ---I 7o(2 v / p|s*t/m|) (12) 

OO 

+2 Ii{2y/p\s*ym\) cos (7 (0 m + 6> m - arg (s*y m ))) 

i=i 

where the last step follows from the Jacobi-Anger formula 



Vm ~ \fpe 




e“ COB ^ =/„(«)+ 2^/ i (a)cos(//3). (13) 

1 = 1 







Then, the integral T m (y rn \O m , s) is 

/ 7r 

p(jjm\s ? 0m)p(0m)d0ri 

-7T 

^m,0-^02/m|) 


0 -|ym| 2 -p | s | 2 


+2^a m? i/ z (2 v /p|5*?/ m |)cos(/(6> m -arg(s*?/ m ))) . 


z=i 


By manipulating p(a; m |# m ) in the same way as in (fl2l> and 
by the orthogonality of the trigonometric functions we obtain 

p(x m , Um\s) = / p(x m \0m)p{0m)lm(y m \0m,s)ddm 

J —TV 

_ exp (—|a; m | 2 - \y m \ 2 - p( 1 + |s| 2 )) 


X | fim,0 + 2 E (3m, l cos (lCm) ■ 


(14) 


z=i 


The result in © follows by substituting ( fldl i in ( ITTb . ■ 
The expression in © holds for any constellation, but can be 
particularized for any choice of constellation. In the following, 
we particularize ([7]) for the phase shift keying constellation 
with N symbols (iV-PSK). This choice will be motivated in 
Section IIII-BI 

Corollary 1: For s selected from an iV-PSK constellation 

r 2 tt • 2tt(JV —1) 1 

as s G n E the likelihood (|7]> can be 

written as 

M 

P (x, y\s = = A JJ / m ,n(arg(j/ m ) - arg(z m )) 


m =1 


(15) 


where / m , n (w m ) = ^ ■ The log- 

likelihood function for the symbol e^~7r is given by 

LR„ = E ln ( /™,o(arg(y m ) - arg(at m ) - —) J . ( 16 ) 

m =1 ' ' 

Proof: Eq. (115b follows immediately for |s| = 1. Eq. (fl6t 
follows by observing that f m>n (u} m ) = f m ,o (w m - ^f). ■ 
Next, we derive the counterparts of Proposition Q] and Corol¬ 
lary [I] for the synchronous operation. 

Proposition 2: The pdf of the received vectors (at, y) given 
a symbol s for the synchronous operation is given by 


p(x, y\s) = A ( 0o + 2 pi cos (IQ 


(17) 


1 = 1 


where 


Pi = aili(2^p\s*l 1 y\)Ii(2Q~p\l r x\), (18) 

C = arg(l T y) - arg(l T at) - arg(s). (19) 

Proof: The likelihood function in this case is given by 

p(x,y\s) = [p(x\6)p(0) [p(y\s,6,<l>)p((l>)d<j)d8. 


The proof follows steps that are similar to Proposition Q] with 
the observation that the channel laws p(x\9) and p(y\s,9,(j)) 
are M-variate complex Gaussian distributions. ■ 

Corollary 2: For s selected from an TV-PSK constellation 
and |s| = 1, the log-likelihood function for the symbol e 3 
can be written as 

LR n = sin irw) sin ( Z ( ar s ( 1 J y) - arg(l T at) - ) . 

1=1 ' ' 

( 20 ) 


A. High SNR Analysis for the Synchronous Operation 

The expressions in Propositions [Q [2] and Corollaries Q] and 
[2] can be easily implemented but do not allow for analytical 
work. Therefore, in this section we present an asymptotic 
analysis as p —> oo (high SNR) for the problem in © and 
©. We start with the synchronous operation as it appears to 
be simpler. The system model for the synchronous operation 
in the case of high SNR can be expressed as 


. 4 * 



JS 


1 


y = —e 3 ( 0+ ^ sl=>y = xe 3 ^s=^- 

Vp 


\x H y\=M\s\ 
p = aig(s)+p 


( 21 ) 


where p = arg (x H y) . From OH it is apparent that the ampli¬ 
tude of s can be decoded error-free. Hence we restrict the study 
to PSK constellations where the signal is impaired by phase 
noise. The observation if) is in this case sufficient statistics. 
We proceed by deriving the asymptotic SER at high SNR for 
PSK constellations. We further specify the pdf in © to be the 
circular normal (also known as von Mises) distribution with 
zero mean and concentration parameter k > 0, p ~ VA4 (0. k), 
which is given by 


Pt>(P) 


COS 4 > 

2irl 0 (n) 


1 

2n 



h(Q 

Io(k) 


cos (If) 


( 22 ) 


for f £ [—7r, 7r). Then t/>|arg(s) ~ VA / l(arg(s), k). By the 

symmetry of the circular normal distribution around its mean, 
the symmetry of the PSK constellations and the fact that we 
assume equal priors for the constellation symbols, the decision 
region for the symbol s n = exp (j^jfQ , n = 0,..., N — 1 is 
[nr ~ 7W TW + Jf )- Then, the probability of error at high 
SNR is given by 


Pr{e} = l -I p^(p)dp = l 



COS 0 

27 t! 0 (k) 


dp. (23) 


Proposition 3: From (l23l > we observe that there is a non¬ 
zero SER floor for the synchronous operation, which depends 
only on the concentration parameter of the phase noise in¬ 
crement, k, and the PSK constellation density, N, but is 
independent of the number of receive antennas, M. 


2 For a fixed argument, x, the value of Iii(x) reduces rapidly as // increases. 
Hence, only a small number of terms is required to approximate accurately 
the detectors in Propositions [7] and EDEI- 










B. High SNR Analysis for the Non-Synchronous Operation 

The system model © and © for the non-synchronous 
operation in the high SNR regime is given by 


Xm = — = e J m 

Vp 

~ A Vrn _ j(6 m +<t> m ) 

ym — c ^ ~^ 

Vp 

4>m = arg {x* m ym) = fm + arg(s). 


(24) 


Similarly to ( EH . the amplitude can be decoded error free also 
in the non-synchronous operation. Hence, we focus only on 
the phase. The likelihood function under the assumption that 
(f> m are independent VA4(0 ,k) random variables is given by 

M e KE"=i«W m -arg( 5 )) 

P*\s{ll>\s) = 11 P9 m \s{^m\s) = - ~ M -• 

m=1 (27r/o(tv)) 


Proposition 4: The scaled log-likelihood ratio for the sym¬ 
bol from an iV-PSK constellation is given by 

LLR a _J_ ( P* | S (^l ai 'g( s ) = ^7r) \ 

" 2 kM \ p*| 8 (¥|arg( fl ) = 0) ) 

(W ) sin (*" - W) ■ <25) 

m =1 

If we denote by e the error event, i.e. the case where the 
detected symbol s is different from the transmitted symbol s, 
then the symbol error probability is given by 

{ N -1 

U {LLR„ > 0 } 

n—1 

The exact calculation of the probability of error in (l26l l 
appears formidable. We therefore derive an upper bound on 
the pairwise symbol error probability of erroneously detecting 
s n = exp (j^p) , n = 1,..., N — 1 when s 0 = 1 was sen{f 
For this purpose we will use the result stated in the following 
lemma. 

Lemma 1: Bernstein Inequality mi Let X m , m = 
1,M be independent and identically distributed random 
variables with E[A m ] = 0, \X m \ < C almost surely for some 
bounded constant C, X s = l ^ m and g = 1k /VAR(A' s ). 

Then for all i > 0 

Pr{A s > N} < exp —■ 

Proposition 5: The pairwise error probability for the de¬ 
tected symbol s n to be s n = exp (j, n = 1,..., N — 1 

3 We note that due to the symmetry of the von Mises distribution around its 
mean and the uniform priors on the input symbols, the conditioning on any 
particular input symbol does not affect the result. The symbol so = 1 with 
arg(so) = 0 is selected for convenience. 


arg(s) = 0 > . 


(26) 


given that the symbol So = 1 was sent is upper bounded by 

' 2 \ ’ 


Pr {LLR n > 0} < exp 


( M ( ; in2( ^ } £44 

^ sJV/\R(X m<n ) Nw 

2 Csin 2 (^) /!(«) 
Z ^ 3 VAR(X m ,„) I 0 (k) 


(27) 


where 

and 

VAR(X„ 


„ a . Ttn\ . 2 ntn\ 7i(/c) 

C = sin — + sin — r . , 
V N ) V N ) Iq(k) 


(28) 


. 2 (Kn\ ( h{n)cas (^) . 2 (nn\( 

(if) ( +am (if) (‘- IJR)j • 

Proof: Let 

v A . /7T n\ . f nn\ . 2 fnn\ 7i(k) 

A„,„ = s,„ {-) sin (fc - ) + .sm 

Then = 0. |X„,„| < C = sin (fi)+sin 2 (m) 

and VAR(X mn ) is given by ( |28| >. Then 

Pr{LLR„ >0} 


= Pr- 




M 

' M 


= Pr < y] x myn > M sin' 


k. m= 1 


2 m\ h{K) 

yNJ 7„(re) 


Define <; = yfM ^/VAR(A' TOjn ) and 

t = -Pm Slir (jgO ^( K ) 
v/VAR(X m>n ) W 

By applying the Bernstein inequality (Lemma© 

( sin2 (^) am V\ 


Pr{LLR n > 0} < exp 


V 


\ v / VAR(X m , n ) z o( K ) 

2 Csin 2 (^) j l(w ) 
+ 3 VAR(X m ,„) / 0 (k) 


/ 


Corollary 3: The pairwise error probability goes to zero at 
least exponentially with M for the non-synchronous operation. 
Also for the SER in d26l > we have liniM->oo (Pr {e} —> 0. 

Proof: We observe that C and VAR(A' m n ) are constant 
with respect to M. Hence, the bound in ( |27| > behaves as 
()(er A/ ). Further, using the union bound |fl9) and Proposition 
[3 the SER can be upper bounded by 

N 

lim Pr {e} < V lim Pr |LLR„ > 0} = 0, 

M—foo ' ^ M—foo 

n= 1 

which establishes the second claim of Corollary [3 ■ 

Hence, in the non-synchronous case the error floor can be 
made arbitrarily small by increasing the number of receive 
antennas. This is in contrast to the synchronous case, shown 
in Proposition |3 where an irreducible SER floor independent 
of M was observed. 






























Fig. 1: The zero mean circular normal (von Mises) distribu¬ 
tion (l22l) for various values of the concentration parameter 
k £ {0, 2, 3, 5, 10}. For n = 0 the distribution is uniform in 
[—7t, 7 r) and becomes more concentrated around the mean as 
k increases. 


IV. Numerical Examples 

In this section we present numerical examples based on the 
theoretical results presented in Section [HI] The phase noise 
increments, <j) rn . m = 1 ,..., M , for the non-synchronous case 
and (p for the synchronous case are distributed as VA1(0 ,k), 
where k is specified for each figure. In Fig. Q] the zero- 
mean circular normal distribution is plotted for various values 
of k £ {0, 2, 3, 5, 10}. For k = 0 the distribution is 
uniform in [— 7r, 7r), whereas it becomes more concentrated 
around the mean as k increases. In Fig. [2] the bit-error-rate 
(BER) performance of the synchronous and non-synchronous 
operation for a BPSK constellatiorQ is calculated via Monte- 
Carlo simulations based on the detectors in Propositions Q] 
and |2] and plotted as a function of p for a fixed number of BS 
antennas, M = 4. Two families of curves are plotted, namely 
for k = 3 and for n = 5. We observe that in the high SNR 
regime the non-synchronous operation exhibits better BER 
performance. The BER floor that appears for the synchronous 
operation depends on the concentration parameter of the phase 
noise increment, n, and is reduced as k increases, which is in 
accordance with Proposition [3] 

In Fig. [3 the BER performance of the synchronous and 
non-synchronous operation with BPSK symbols is plotted as 
a function of p for fixed n = 2. Two families of curves 
are shown, namely for M = 2 and for M = 5. The 
BER performance in the high SNR is superior in the non- 
synchronous case. As in Fig. [2] an error floor is observed for 
the synchronous operation. However, the error floor appears 
to be independent of M, as noted in Proposition [3] The error 
floor appears also in the non-synchronous operation. However, 

4 For BPSK modulation one symbol is one bit, so the BER is equal to the 
SER. 



Fig. 2: Bit-Error-Rate (BER) performance of BPSK modula¬ 
tion as a function of p [dB] for fixed M = 4 and two choices 
of k, i.e. k = 3 and n = 5. 



Fig. 3: Bit-Error-Rate (BER) performance of BPSK modula¬ 
tion as a function of p [dB] for fixed k = 2 and two choices 
of M, i.e. M = 2 and M = 5. 


it is lower than the respective floor of the synchronous case. In 
addition, it is reduced as M increases, as proved in Proposition 
0] Similar behavior has been observed in l9l-|ff3l. Finally, in 
Fig. H] the SER of an 8-PSK system is plotted as a function of 
p for fixed k = 10 and two values of M, M = 3 and M = 5. 
Fig. a extends the conclusions of Fig. 0 to a multi-symbol 
constellation and verifies Propositions [3] and a 

We conclude the discussion on the numerical examples 
by noting that the synchronous operation shows better per¬ 
formance at low SNR under certain circumstances. This is 
particularly the case in Fig. a l' 01 ' the curve corresponding to 
M = 4 and k = 5. On the other hand, for 8-PSK constellation 
both operations exhibit similar performance, as shown in 
Kg. a In the non-synchronous operation the independent 
phase noise components have an effect that is similar to the 


























Fig. 4: Symbol-Error-Rate (SER) performance of 8-PSK mod¬ 
ulation as a function of p [dB] for fixed k = 10 and two 
choices of M, i.e. M = 3 and M = 5. 


additive noise so this component is amplified. However, in the 
synchronous operation phase noise is a common rotation of the 
signal at all antenna elements. When the variance of the phase 
noise increments is sufficiently small and the constellation 
is not dense, then the common phase rotation due to phase 
noise degrades the SER performance more moderately in 
comparison to the independent rotations that appear in the 
non-synchronous operation. However, when the constellation 
is dense, i.e. sensitive to common rotations, this advantage 
of the synchronous operation disappears. This is the reason 
why we observe similar SER performance in the low SNR 
regime of Fig. |4] for the case of 8-PSK. It is clear that 
there is a regime where the synchronous operation is better 
but after a transition region the non-synchronous operation 
exhibits superior performance. The precise calculation of this 
transition region appears to be dependent on many parameters 
such as the SNR, the constellation density and the variance of 
the phase noise innovations, however, it will be considered in 
future work. 

V. Conclusions 

In this paper, the problem of optimal ML detection in 
a phase noise impaired SIMO systems with training was 
considered. Two different operation modes were investigated, 
namely the case of a common oscillator for the whole receive 
array (synchronous operation) and the case of separate os¬ 
cillators for each receive antenna element (non-synchronous 
operation). The optimal ML detection rules were derived 
under a very general parameterization for the phase noise 
increment distribution. For both operations, the asymptotic 
ML detection rules were derived in the high SNR regime 
when the phase noise increments were distributed according 
to a circular normal (von Mises) distribution. An SER floor 
was observed for both operations. The exact expression of 
this floor was derived for the synchronous operation, which 


showed that it depends only on the constellation density and 
the concentration parameter of the von Mises distribution but 
is independent of the number of BS antenna elements. An 
upper bound on the pairwise error probability was derived 
for the non-synchronous operation. This bound was shown 
to approach zero exponentially in M, which implies that the 
probability of error can be made arbitrarily small by increasing 
the number of BS antennas. Numerical examples were shown 
to support the derived propositions. 
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